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Figure 2 



• Sequences and annotations collect from a variety of sources. 

v Sequence Analysis ; 

• Aligned to the draft assembly of the genome. 

• Low quality bases trimmed from ESTs. 

• Polyadenylation sites detected. 

.\ Seed Cluster Creation 

• Seed clusters created for each UniGene Cluster. 

• Additional seed clusters are created with potential full-length 
mRNAs not in UniGene. 

• Seed clusters are subclustered based on the contig. 

• Sequences are subclustered in transcriptome space. 

QnpitMidn ;Based Subeluslping 

• EST read directions, CDS annotations, consensus splice sites, 
and polyadenylation sites are used to predict the orientation of 

the subcluster. The subcluster is resubclustered if substantial conflicts exist. 

Picking Probe ^eleptidil Riegioiis ; 

• For each subcluster, regions for probe selection are selected 
from either the consensus or exemplar sequences. 

• Multiple regions maybe choosen due to alternative polyadenylation sites. 

Prioritizing Sequence Selection Regions 

• Regions for probe selection are prioritized based on the quality 
of sequence and annotation. 
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